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OUTPUT SUM OF TRANSDUCERS: LIMITING 
DISTRIBUTION AND PERIODIC FLUCTUATION 

CLEMENS HEUBERGER, SARA KROPF, AND HELMUT PRODINGER 


Abstract. As a generalization of the sum of digits function and 
other digital sequences, sequences defined as the sum of the out¬ 
put of a transducer are asymptotically analyzed. The input of the 
transducer is a random integer in [0,iV). Analogues in higher di¬ 
mensions are also considered. Sequences defined by a certain class 
of recursions can be written in this framework. 

Depending on properties of the transducer, the main term, the 
periodic fluctuation and an error term of the expected value and 
the variance of this sequence are established. The periodic fluc¬ 
tuation of the expected value is Holder continuous and, in many 
cases, nowhere differentiable. A general formula for the Fourier co¬ 
efficients of this periodic function is derived. Furthermore, it turns 
out that the sequence is asymptotically normally distributed for 
many transducers. As an example, the abelian complexity func¬ 
tion of the paperfolding sequence is analyzed. This sequence has 
recently been studied by Madill and Rampersad. 


1. Introduction 

Over the last decades, asymptotic properties of digital sequences 
have been studied by many authors. The simplest example is the g-ary 
sum of digits, see Delange j?]. This has been generalized to various 
other number systems (cf. jSS], [26], [33], [IT], [3], [H], [15], [22l, [20]). 
Similar results have been obtained for other digital sequences (cf. [6] 
and m)- Frequently observed phenomena in the asymptotic analysis of 
these sequences include periodic fluctuations in the second order term 
and asymptotic normality (see also il)- 
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The purpose of this article is to use finite state machines as a uniform 
framework to derive such asymptotic results. The results mentioned 
above will follow as corollaries from our main results, see the end of 
the introduction for more details. As an example of a new result fitting 
into this framework, we study the abelian complexity function of the 
paperfolding sequence (cf. |27]), see Example 

Our main focus lies on transducers: these finite state machines trans¬ 
form input words to output words using a hnite memory (see Section 
for a more precise definition). In our case, the input is the g-ary digit 
expansion of a random integer in the interval [0, N). We then asymp¬ 
totically study the sum of the output of the transducer for N ^ oo. 
This is also extended to higher dimensions. 

While some of the examples can easily be formulated by transducers, 
other examples are more readily expressed in terms of recursions of the 
shape 

(1) a{q'^n + X) = a{q'^^n + rx) + tx for 0 < A < g'' 

with fixed k, ka, fa G Z, € 1^ and kx < k. We transform such a 
recursion into a transducer in Theorem |4] in Section 12.61 

Several notions abstracting the sum-of-digits and related problems 
have been studied. One of them is the notion of completely g-additive 
functions a : No —?• M with 

a{qn -|- A) = a(n) -|- a(A) 

for 0 < A < g (cf. |1|). These have been generalized to digital sequences 
as defined in BE]: A sequence a(n) is a digital sequence if it can be 
represented as a sum Y2w /('^) ''^here / is a given function and w runs 
over all windows of a fixed length n of the g-ary digit representation of 
n. These digital sequences can easily be formulated by a recursion as 
in (0. 

For a transducer T, let T(n) be the sum of the output labels of 
T when reading the g-ary expansion of n. For a positive integer N, 
we study the behavior of T(n) for a uniformly chosen random n in 
{0,..., A^ — 1}. Assuming suitable connectivity properties of the un¬ 
derlying graph of the transducer, we obtain the following results. 

• The expected value is given by 

E(r(n)) = erloggAr + Ti(loggA^) + o(l) 

for a constant ej- and a periodic, continuous function \ki (The¬ 
orem . 

• The variance is 

V(r(u)) = Ur log, AT - v[/?(log,iV) + vI/ 2 (log,iV) + o(l) 

with constant vj- and a periodic, continuous function ^ 2 ( 0 ;) 

(Theorem [^. 
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• After suitable renormalization, T{n) is asymptotically normally 
distributed (Theorem]^. 

• The Fourier coefficients of Ti are given explicitly in Theorem]^ 
and the Fourier series converges absolutely and uniformly. 

• The function is nowhere differentiable provided that ej- is 
not an integer (Theorem]^. 

The exact assumptions for the various results are given in detail in the 
respective theorems. Results for higher dimensional input are available 
for expectation, variance, normal distribution and Fourier coefficients. 

Our theorems are generalizations of the following known results. 


For the sum of digits of the standard g-ary digit representa¬ 
tions (cf. [7]), we obtain an asymptotic normal distribution, the 
Fourier coefficients and the non-differentiability (for even[^g). 
The error term vanishes, as stated in Remark 3.4 Therefore, 


the formula is not only asymptotic but also exact. The formulas 
for the Fourier coefficients by Delange [7] also follow from our 
Theorem [2l 

The occurrence of subblocks in standard and non-standard digit 
representations is defined by a strongly connected, aperiodic 
transducer. Thus we obtain the expected value, the variance, 
the limit law and the Fourier coefficients (cf. |2^ l26l 114] for the 
expected value). For one dimensional digit representations, we 
also obtain the non-differentiability (assuming e-y ^ 0,1) of the 
fluctuation in the expectation. 

The Hamming weight is a special case of the occurrence of sub¬ 
blocks. Thus, Theorem[^is a generalization of the results about 
the width-tc non-adjacent form [20], the simple joint sparse form 
na and the asymmetric joint sparse form |2U) . 

A transducer defining a completely g-additive function consists 
of only one state. Therefore, we obtain an asymptotic normal 
distribution (as in 0 ), the Fourier coefficients and the non¬ 
differentiability (assuming er ^ Z and integer output). Here, 
the error term vanishes, too. 

A digital sequence is defined by a strongly connected, aperi¬ 
odic transducer. Thus, digital sequences are asymptotically 
normally distributed or degenerate. Assuming e-j ^TL and inte¬ 
ger output, the periodic fluctuation Ti(a:) is non-differentiable. 
The Fourier coefficients can be computed by Theorem See 
also [6] for results on the expected value. 

Automatic sequences [I] are also defined by transducers: The 
output labels of all transitions are 0 and the final output labels 


^Our approach in Theorem requires that the constant ej- of the main term of 
the expected value is not an integer. In this case, e-j- = which is an integer if 
q is odd. 




4 


CLEMENS HEUBERGER, SARA KROPF, AND HELMUT PRODINGER 


are as in the definition of such sequences. Theorem [T] gives the 
expected value with er = 0 (see also [29]) and, depending on 
the transducer, also the variance with ^ 7 - = 0. The Fourier 
coefficients of the periodic fluctuation of the expected value are 
given explicitly in Theorem 

• In [T7|, Grabner and Thuswaldner investigate the sum of digits 
function for negative bases S-q{n). They give a transducer to 
compute the function S-q{n) — S-q{—n). Their result about the 
limit law follows directly from our Theorem 


As an example of a new result obtained by Theorem we give an 
asymptotic estimate of the abelian complexity function of the paper- 

In |27|, the authors prove that this 
As consequences of Theo- 


folding sequence in Example 2.8 
sequence satisfies a recursion of type ([^. 


rem ^ the expected value is ~ ^ log 2 iV, the variance is ~ log 2 N 
and me sequence is asymptotically normally distributed. 

In the sequel, we discuss the relation of our setting and our results 
with the notion of g-regular sequences introduced in [Tj. 

A sequence is g-regular if it is the first coordinate of a vector v{n) 
and there exist matrices Vq, ..., Vq^i such that 


( 2 ) 


v{qn + e) = I 4 n(n) 


for £ e { 0 , 1 ,..., g — 1 }. 

The concept of g-regular sequences is more general than our setting, 
but a broader variety of asymptotic behavior is observed which pre¬ 
cludes any generalization of our results to general g-regular sequences. 

While T{n) is a g-regular sequence for any transducer T (see Re¬ 
mark 3.10), the converse is not necessarily true: Obviously, the sum of 


the output of a transducer reading the input n is always bounded by 
0{\ogn). However, the 2-regular sequence^ 


a{n) = 


n if n is a power of 2 , 
0 otherwise 


can clearly not be bounded by C>(logn). 

Asymptotic estimates for g-regular sequences are given by Dumas uni 
m- By restricting our attention to sequences defined by transducers, 
we obtain an asymptotic estimate of the variance, explicit expressions 
for the Fourier coefficients of the fluctuation in the second term of 
the expected value, non-differentiability of this fluctuation as well as a 
central limit theorem. 

Section contains all the theorems and the required notions. In 
Section Theorem]^ formulas for the first and second moment of the 
output sum of a transducer and its limiting distribution are presented. 
In Theorem in Section 2G the Fourier coefficients of the periodic 


^Use a(0) = (0,1)^ (where ^ denotes transposition), Vb = (0 1 ) D = (0 g) • 
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0 I 0 


1 I 0 
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Figure 1. Transducer computing the Hamming weight 
of the non-adjacent form. 


fluctuation \I/i(a;) of the expected value are stated. We discuss the 
non-differentiability of Ti(x) in Theorem]^ in Section 2.5 


Section 2.6 deals with sequences satisfying the recursion Q and 
higher dimensional analogues. We construct a transducer computing 
this sequence in Theorem]^ Thus, from Theorem[^ the expected value, 
the variance and the limit distribution follow in many cases. 

This construction and the computations for the constants 67 -, vj- 
and the Fourier coefficients can be done algorithmically by the mathe¬ 
matical software system Sage |3l]: The general framework is included 
in Sage version 6.4.1 using its finite state machine package described 
in |19) . The code for the Fourier coefficients and the construction 
from a recursion is submitted for inclusion in future versions of Sage, 
see http: //trac. sagemath. org/17222 and http: //trac. sagemath. 
org/17221, respectively. 

In Sections to we give the proofs of all the theorems from Sec¬ 
tion [21 


2. Results 

This section starts with the definition of some notions about the 
connectivity of a transducer. Then we will state the theorems about 
the moments and the limiting distribution, the Fourier coefficients, the 
non-differentiability, and the construction of a transducer computing a 
sequence given by a recursion as in Q. 

2.1. Notions. We consider complete, deterministic and subsequential 
transducers (cf. |5l Chapter 1]). In our case, the input alphabet is 
{0 ,... ,q — l}*^ for a positive integer d and the output alphabet M. A 
transducer is said to be deterministic and complete if for every state 
and every digit of the input alphabet, there is exactly one transition 
starting in this state with this input label. A subsequential transducer 
T (cf. [20]) is defined to be a finite deterministic automaton with one 
initial state, an output label for every transition and a final output 
label for every state. 

Figure presents an example of a complete, deterministic, subse¬ 
quential transducer. The label of a transition with input e and output 
6 is written as £ I 5. 
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The input of the transducer is the standard g-ary joint digit rep¬ 
resentation of an integer vector n G Nq, i.e. the standard g-ary digit 
representation at each coordinate of the vector n. The input is read 
from right (least signihcant digit) to left (most signihcant digit), with¬ 
out leading zeros. Then the output of the transducer is the sequence 
of the outputs of the transitions along the unique path starting in the 
initial state with the given input and the hnal output of the last state of 
this path. The element T(n) of the sequence dehned by the transducer 
T is the sum of this output sequence. 

Using hnal output labels is convenient for our purposes. Clearly, 
it would also be possible to model the hnal output labels by using 
an “end-of-input” marker and additional transitions. In the context 
of digital expansions, the behavior can usually also be obtained by 
reading a sufficient number of leading zeros. But the approach using 
hnal outputs is more general as it is not required that the hnal outputs 
are compatible with the output generated by leading zeros. 

For the various results, diherent properties of the complete, de¬ 
terministic, subsequential transducer and its underlying digraph are 
needed. All states of the underlying digraph are assumed to be ac¬ 
cessible from the initial state. Contracting each strongly connected 
component of the underlying digraph gives an acyclic digraph, the so- 
called condensation. A strongly connected component is said to be 
final strongly connected if it corresponds to a leaf (i.e., a vertex with 
outdegree 0) in the condensation. Let c be the number of hnal strongly 
connected components. We call a transducer or a digraph finally con¬ 
nected if c = 1. 

For the asymptotic expressions, only the hnal strongly connected 
components are important. All other strongly connected components 
only inhuence the error term. Thus, we are not interested in the peri¬ 
odicity of the whole underlying digraph, but in the periodicity of the 
hnal strongly connected components. The period of a digraph is de¬ 
hned as the greatest common divisor of all lengths of directed cycles of 
the digraph. For j = 1,..., c, let pj be the period of the hnal strongly 
connected component Cj. Dehne the final period of the digraph as 

p = \cm{pj I j = 1 ,... ,c}. 

We call a digraph finally aperiodic if p = 1. If the underlying digraph 
is strongly connected, its hnal period is equal to its period. 

For proving the non-diherentiability of the huctuation, we not only 
need a hnally aperiodic, hnally connected digraph (p = c = 1), but 
also a reset sequence. A reset sequence is an input sequence such that 
starting at any state and reading this sequence leads to a specihc state 
s. If the transducer is not hnally aperiodic and hnally connected, then 
there cannot exist a reset sequence. 
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2.2. Moments and Limiting Distribution. This section contains 
the theorem about the moments of the output sum T(n) and the lim¬ 
iting distribution. Further results about the periodic fluctuation can 
be found in Theorems [2] and |3l 

As probability space, we use = {0, 1 ,..., endowed with 

the equidistribution measure. 

Denote by $^^,(72 the cumulative distribution function of the normal 
distribution with mean /i and variance 7 ^ 0. Thus, 



Theorem 1. Let d>l,Tbea complete, deterministic, subsequential 
transducer with input alphabet { 0 , 1 ,..., g — 1 }'^, output alphabet M, 
final period p, and c final components. 

Then T(n) has the expected value 

(3) E(r(n)) = erlog,iV + Ti(log,iV) + C>(Ar-«logAr) 

where the constants e-j- and ^ > 0 are given in ([^ in Section \K^ and 
Ti(a:) is a p-periodic, Holder continuous function. 

If all bj given in (|^ are positive, the distribution function of'T{n) 
can be approximated by a mixture of c Gaussian distributions with 
weights Xj, means aj log^ N and variances bj log^ N for some constants 
Oj and Xj > 0 with 'YTj=i ~ 1? given m (|^. In particular, 


P 


T(n) 

yiog, N 





{x) + 0{\og 2 iV) 


for all a; G M. 

If all Oj are equal, then T{n) has the variance 

( 4 ) V(r(n)) = nr log, iV-T?(log,iV) + T2(log^iV) + 0 (iV-«log 2 iV) 

with constant v-j- gM. (given in and a p-periodic, continuous func¬ 
tion 4/2(0;). Otherwise, the variance is V(T(n)) = ©(log^iV). 

If all Oj are equal, T{n) converges in distribution to a mixture of 
Gaussian (or degenerate) distributions with means 0 and variances bj, 
weighted by Xj. In particular, if all bj > 0, 


P 


r(n)-E(r(n)) ^ \ 


^ Aj<ho,fe^.(x) -h C»(log 2 N) 
i=i 


holds for all a; G M. 

If furthermore c = 1 and vj- 7 ^ 0, then T{n) is asymptotically nor¬ 
mally distributed. 


We give the proof of this theorem in Section 
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0 I 0,1 I 0 



Figure 2. Transducer for Remark l2Tl 

Remark 2.1. The assumption that bj > 0 is essential for obtaining 
uniform convergence of the distribution function and the speed of con¬ 
vergence in particular. To see this, consider the transducer in Figure 
It is easily seen that T{n) = (—1)"'. For even N, the distribution 
function of T (n) / ^J\og 2 N is given by 

0 if a: < -l/^log2 AT, 

1/2 if - l/\/log2 < X < 1/^log2 N, 

1 if l/^log2 AT < X, 
which does not converge uniformly. 

2.3. Eigenvalues and Eigenvectors of the Transition Matrix. 

For the constants in Theorem [T] and the Fourier coefficients in Theo¬ 
rem we need the notion of a transition matrix of the transducer and 
properties of its eigenvalues and eigenvectors. 

We label the states of the transducer with contiguous positive inte¬ 
gers starting with 1. We denote the indicator vector of the initial state 
by ei. 

Definition 2.2. Let f G M be in a neighborhood of 0. 

The transition matrix for e G {0,..., g — 1}'^ is the matrix whose 
(■Si,S 2 )-th entry is if there is a transition from state si to state S 2 
with input label £ and output label 6, and 0 otherwise. 

Let M be the sum of all these transition matrices. 

Lemma 2.3. There are differentiable functions frj{t) in a neighborhood 
oft = 0 for j = 1, . .., c such that the dominant eigenvalues of M are 
exp(^^) in this neighborhood of t = 0 for some of the / G P = 
{k ^ Z, \ —pl2 < k < p/2}. For each of these dominant eigenvalues, 
the algebraic and geometric multiplicities coincide. For t = 0, Pj{0) = 

The proof of this lemma is given in Section 

Let I G Z. Consider the (not necessarily orthogonal) projection 
onto the direct sum of the left eigenspaces of M corresponding to the 
eigenvalues Pj{t) exp(^) for j = 1,..., c such that the kernel is the 
direct sum of the remaining generalized left eigenspaces. Let wj (t) be 
the image of ej under this projection, where ^ denotes transposition. 
The definition of wJ (t) only depends on I modulo p. 


p(^<. 
V \/log2 N 
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We write wj for wj (0) and Wi~^ for the derivative of wj (t) at f = 0. 
Furthermore, wJ is either the null vector or a left eigenvector of M 
corresponding to the eigenvalue g'^exp(^^). 

Let Cj be a final component with corresponding indicator vector Cj. 
Define the constants 




In Section 


3.1 


With these 


we will show that Xj > 0 and ~ 1- 

definitions, the constants in Theorem can be expressed 


as 


ttj = -iq Vj(0), 

C 

er = '^Xjttj, 
j=i 

(5) , /i'(0)2-gV;'(0) 

C 

vr = ^Xjhj. 
j=i 


Finally, ^ > 0 is chosen such that all non-dominant eigenvalues of M 
have modulus strictly less than q'^~^ at t = 0. 

These constants can be interpreted as follows: aj log^ N and bj log^ N 
are the main terms of the mean and the variance, respectively, of the 
output sum of the final component Cj. These expressions including 
the derivatives of the eigenvalues correspond to the formulas for mean 
and variance given in m Theorem IX.9]. The constants ej- and vj- 
are convex combinations of the corresponding constants of the final 
components Cj. 

The positive weight Xj in these convex combinations turns out to be 
the asymptotic probability of reaching the final component Cj. This 
is connected to the following interpretation of the left eigenvector : 
If the final period p is 1, the entries of will be shown to be the 
asymptotic probabilities of reaching the corresponding states. This 
corresponds to the left eigenvector used in a steady-state analysis. If 
p > 1, these probabilities depend on the length of the input modulo 
p. Then, we will prove that Wq gives the average of these probabilities 
taken over all residues modulo p. These interpretations are justified in 
Section o 


2.4. Fourier Coefficients. This section contains the formulas for the 
Fourier coefficients of the periodic ffuctuation Ti(a;). For this purpose, 
we need the following definitions. 

Let Yk = for /c G Z and 1 be a vector whose entries are all one. 
The s-th coordinate of the vector b{n) is the sum of the output of 
the transducer T (including the final output) if starting in state s with 
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input the g-ary joint expansion of n. In particular, the hrst coordinate 
of b{n) is T{n), and &(0) is the vector of hnal outputs. Furthermore, 
dehne the vector-valued function H(z) by the Dirichlet series 

(6) H{z) = J2b{n)\\n\\^\ 

n>0 

n/0 

where the inequality in the summation index is considered coordinate- 
wise and II ■ ||oo is the maximum norm. 


Theorem 2. Let T he a sub sequential, complete, deterministic trans¬ 
ducer. Then the Fourier coefficients of the p-periodic fluctuation 
are 

Co = - —j - iw'J 1-^3 Res, 2 =d w^H{z), 

dlogq d 

Ck = — - Res,=d+xk '^jH^z) 

“ + Xfc 

for k f 0. 

The Fourier series converges absolutely and uni¬ 

formly. 

The function wjH{z) is meromorphic in iR.z > d — 1. It has a 
possible double pole at z = d for = 0 and possible simple poles at 
z = d + Xk for k 0. 


The proof of this theorem is in Section 

The inhnite recursion given in Lemma |4.5| can be used to numeri¬ 
cally evaluate the Dirichlet series H{z) with arbitrary precision and 
to compute its residues ed. z = d -\- xi (see Lemma 4.7 and [IS]). For 
d = 1, the computation of the Fourier coefficients can be done by the 
mathematical software system Sage [SI] (using the code submitted at 
http://trac.sagemath.org/17222). 


Example 2.4. The (artihcial) transducer in Figure]^ has two hnal com¬ 
ponents with periods 2 and 3, respectively. Thus the hnal period is 6 
and the function 4/i(a;) is 6-periodic. The constant ej- of the expected 
value is In Figure the partial Fourier series with 2550 Fourier 
coefficient^ is compared with the empirical values of the periodic huc- 
tuation Ti, i.e.. 


(8) f E nn) - f log2 N 

n<N 


with integers N and 4 < log 2 < 16. 

The computation of these 2550 Fourier coefficients took less than 
6 minutes using a standard dual-core PC. 
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Figure 3. Transducer of Example |2.4 
final with final output 0. 


All states are 



Figure 4. Partial Fourier series compared with the em¬ 


pirical values of the function 4/i of Example 2.4 


In Example |2.8| we compute the first 2550 Fourier coefficients of the 
abelian complexity function of the paperfolding sequence. 

As a corollary of Theorem we obtain the following result which 
was already proved by Delange j?]. 


Corollary 2.5. The Fourier coefficients of the periodic fluctuation 

Ti(logg N) = ~ N 

n<N 


for the q-ary sum-of-digits function Sq{n) are 

q 

Co = 

(9) 


2 logg 


^ (log(2vr) - 1) - 




q-1 


-Cixk) 


Xfc(l + Xfc)logg 

for k 0 and Xk = where ( denotes the Riemann (-function. 
We prove this corollary in Section]^ 


^We use 2550 Fourier coefficients in this plot because the period length of the 
next summand of the Fourier series in Figure]^ is already less than the resolution 
of a standard printer. 
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2.5. Non-differentiability. In this section, we prove that for certain 
transducers, the periodic fluctuation \l/i(a:) of the expected value is 
nowhere differentiable. 


Theorem 3. Let d = 1. Assume that er ^ Z and that the transducer 
T has a reset sequence and output alphabet Z. Then the function tl/i(x) 
is non-differentiable for any a: G M. 


The proof can be found in Section There, we follow the method 
presented by Tenenbaum [32], see also Grabner and Thuswaldner im. 

In [3^ IT?] , the reset sequence consists only of O’s. If working with 
digit expansions, it is often possible to choose such a reset sequence. 
However, in the context of recursions, this is not always possible, see 
Example 2.8 There the reset sequence is (00001). 

For a general hnally aperiodic, hnally connected transducer, the ex¬ 
istence of a reset sequence cannot be guaranteed. 


2.6. Recursions. In this section, we describe how to reduce a recur¬ 
sion to a transducer computing the given sequence. All inequalities in 
this section are considered coordinate-wise. 

Let q > 2, K, Kx E Z, G Z'^, tx E M. and 0 < < k for 

0 < A < g'^1. If d > 2, then additionally let > 0 for all A. 

Consider the sequence a(n), n G Nq, dehned by the recursion 

(10) a{q'^n -|- A) = a{q'^^n Vx) +tx for 0 < A < g'^1 


and for all integer vectors n such that the arguments on both sides are 
non-negative. Furthermore, initial values a(n) for n E X have to be 
given for a suitable hnite set X C Mg. 

It must be ensured that the recursion (10) does not lead to conflicts 
and that the set of X is appropriate. Additionally, we require that X 
is minimal (with respect to inclusion). In that case, we say that the 
recursion is well-posed. 

In Section we construct a subsequential, complete, deterministic 
transducer T (also when the recursion is not well-posed) reading the 
g-ary joint expansion of integer vectors without leading zeros. We 
will dehne a distinguished subset of its states, called simple states. 
Furthermore, disjoint classes Xi, ..., Fk of integer vectors will be 
dehned. 


Theorem 4. The recursion (10) is well-posed if and only if 

(1) for each cycle consisting of simple states with transitions with 
zero input label, the sum of its output transitions vanishes and 

(2) the set X consists of one representative of each Fj, 1 < j < K. 
In that case, the sum of the output ofF is the sequence a, i.e., T{n) = 
a{n) for all n >0. 

The proof of this theorem is in Section Combining this result 
with Theorem yields an asymptotic analysis of the sequence a(n), as 
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in Example 2^ Moreover, this asymptotic analysis can be performed 
algorithmically in Sage for d = 1 (using the code submitted at http: 
//trac. sagemath.org/17221). A combinatorial description of the sets 


Fi involving an auxiliary transducer is given in Remark 6.1 


Remark 2.6. For d > 2, and ^ 0, the sequence cannot be computed 
by a hnite transducer; For every j > 0, there are non-zero integer 
vectors n > 0 , n' > 0 with n = n' (mod q^) —i.e., a hnite determin¬ 
istic transducer cannot distinguish between n and n' —such that the 


recursion (10) can be applied for the argument g'^n -|- A but cannot be 


applied for g^n' -|- A. 

This problem does not arise in the case of dimension d = 1\ if the 
end of the input is not yet reached (this is something the transducer 
knows), there is a guaranteed forthcoming digit > 1 (instead of 7 ^ 0 
in the higher dimensional case). This information is enough to decide 
whether the recursion can be used. 


Remark 2.7. Suppose that the given sequence is dehned for n > no for 
some constant Uq. Then the sequence b{n) = a{n + no) fulhlls (10) 
with Kx, Tx and tx replaced by q^>^s + r^J^ — no and respectively, 
where no -|- A = q^s + for 0 < M < g'^1. Then Theorem can be 
applied. 


Example 2.8. Consider the abelian complexity function p{n) of the 
paperfolding sequence. The paperfolding sequence is obtained by re¬ 
peatedly folding a strip of paper in half in the same direction. Then we 
open the strip and encode a right turn by 1 and a left turn by 0. The 
abelian complexity function p(n) gives the number of abelian equiv¬ 
alence classes of subwords of length n of the paperfolding sequence. 
Two subwords of length n are equivalent if they are permutations of 
each other. In BZI. the authors prove that this sequence satishes the 
recursion 



p(4n) = 

= 

), 



p(4 

a 2) = 

= p{2n 

+ 

1) 

+ 

p(16: 

fi 1) = 

= p{8n 

+ 

1) 

) 

p(16: 

n -|- 3) = 

= p{2n 

+ 

1) 

+ 

p(16: 

fi 5) = 

= p(4n 

+ 

1) 

+ 

p(16: 

rL + 7) = 

= pi2n 

+ 

1) 

+ 

p(16: 

n 9) = 

= p{2n 

+ 

1) 

+ 

p(16n 

+ 11) = 

= p(4n 

+ 

3) 

+ 

p(16n 

+ 13) = 

= p{2n 

+ 

1) 

+ 

p(16n 

+ 15) = 

= p{2n 

+ 

2) 

+ 
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1 I 0 



Figure 5. Transducer computing the abelian complex¬ 
ity function p{n) of the paperfolding sequence. For sim¬ 
plicity, the hnal output labels are omitted. 

with p(l) = 2 and p(0) = 0. The constructed transducer is shown in 
Figure]^ For simplicity, we do not state the hnal output labels in this 
hgure. The expected value and the variance are 

E(p(n)) = ^log 2 iV + v[/i(log 2 iV) + 0(iV-«logiV), 

432 

V(p(n)) = — log 2 N - T2(log2 N) + vl/ 2 (log 2 N) + 0(iV-« log^ N) 

with 0 < .^ < 0.5604267891, as the second largest eigenvalues of the 
transition matrix are —0.7718445063 ± 1.1151425080 h The sequence 
p{n) is asymptotically normally distributed. The functions Ti(x) and 
4 / 2 ( 0 ;) are 1-periodic and continuous. The reset sequence of the trans¬ 
ducer is (00001) (reading from right to left). The function Ti(a;) is 
nowhere differentiable and its Fourier series converges absolutely and 
uniformly. The hrst 24 Fourier coefficients of \t'i(x) are listed in Ta¬ 
ble In Figure]^ the trigonometric polynomial formed with the hrst 
2550 Fourier coefficients is compared with the empirical values of the 
function 4/i(a;) (see ([^). 
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1 

Cl 

1 

Cl 

0 

1.5308151288 

12 

-0.0002297481 + 0.0009687657 i 

1 

-0.0162585750 -h 0.0478637218 i 

13 

0.0006425378 + 0.0006516706 i 

2 

0.0054521982 -7 0.0075023586 i 

14 

0.0000413217 - 0.0003867709 i 

3 

-0.0028294724 -7 0.0086495903 i 

15 

-0.0005632948 - 0.0001843541 i 

4 

0.0036818110 0.0021908312 i 

16 

0.0009051717 - 0.0000476354 i 

5 

-0.0028244495 -h 0.0014519078 i 

17 

-0.0004621780 - 0.0000594551 i 

6 

-0.0008962222 -7 0.0030512180 i 

18 

-0.0000127264 - 0.0003100798 i 

7 

0.0015033904 -7 0.0013217107 i 

19 

0.0004112716 + 0.0001954204 i 

8 

-0.0006766166 - 0.0015392566 i 

20 

-0.0000011706 + 0.0004183253 z 

9 

0.0016074870 - 0.0000503663 i 

21 

-0.0001027596 + 0.0004091624 i 

10 

-0.0006908394 -7 0.0018753575 i 

22 

-0.0004725451 + 0.0004237489 z 

11 

-0.0008974336 -h 0.0007658455 i 

23 

-0.0000596181 + 0.0002323317z 


Table 1. First 24 Fourier coefficients of the abelian 
complexity function p{n) of the paperfolding sequence. 



Figure 6 . Partial Fourier series compared with the em¬ 
pirical values of 4/i(x) of the abelian complexity function 
of the paperfolding sequence. 


3. Asymptotic Distribution — Proof of Theorem [T] 

This section contains some lemmas which will together imply Theo¬ 
rem!^ Our plan is as follows: First, we give auxiliary lemmas about the 


eigenvalues and eigenvectors of the transition matrix M in Section 3.1 


Section |3.2 contains an asymptotic formula for the characteristic func¬ 
tion of the random variable T(n). We use this characteristic function 
to give formulas for the expected value and the variance in Section 3ffi 
and prove the continuity of the periodic fluctuations in Section |3.4 


Finally, we prove the central limit theorem in Section 3.5 

We use the notation (££,...£o)g for fhe standard g-ary joint digit 
representation of an integer vector with £i 7 ^ 0. For a real number in 
the interval [ 0 , g), we write (£o"^i • • •)? for the g-ary digit representation 
choosing the representation ending on 0'^ in the case of ambiguity. Fur¬ 
thermore, we use Iverson’s notation |T 8 ]: [expression] is 1 if expression 
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is true and 0 otherwise. All (9-constants depend only on q, d and the 
number of states. 


3.1. Transition Matrix and its Eigenvectors. This section con¬ 
tains the proofs of some results on the eigenvalues, eigenvectors and 
eigenprojections of the transition matrix M. 

For the proof of Theorem we use the following lemma which de¬ 
scribes the eigenvalues of a matrix in a similar way as the Perron- 
Frobenius theorem (cf. |13)). 

Lemma 3.1. Let M be a matrix with complex entries whose underlying 
directed graph is p-periodic and strongly connected. Then the set of non¬ 
zero eigenvalues of M can be partitioned into disjoint sets of cardinality 
p where each set is invariant under multiplication by and all 

eigenvalues in one set have the same algebraic multiplicities. 


Proof. Since the underlying directed graph of M E 
connected, p-periodic graph, we can write M as 


is a strongly 


/ 0 A2 


M = 


0 

As 


0 

VAi 


0 


0 \ 

0 

Ap 

0 / 


with block matrices Ai by reordering the vertices. Then M — xl is the 
product of the matrices 


/- 


-xl 

0 


0 


0 

V Ai 


and 


// 

0 


VO 


xP 


—xl 


xP 


0 


0 

n^=i A, 




xl / 


-^Ao 


0 \ 


I 

0 


A A, 


/ 


Let h(x) be the characteristic polynomial of 11^=1 Aj G Thus 

the characteristic polynomial of M is Therefore, the 

eigenvalues of M are either 0 or any p-th root of a non-zero eigenvalue 

ofULiA- □ 


With this lemma, we can prove Lemma 2^ about the eigenvalues of 
the matrix M: 











OUTPUT SUM OF TRANSDUCERS 


17 


Proof of Lemma 2.3[ First, consider the case t = 0. By construction, 
q'^ is an eigenvalue with right eigenvector 1 of M. As ||M||oo < 
where || • ||oo denotes the row sum norm, is a dominant eigenvalue. 

Consider the strongly connected components of the underlying graph 
of T. Each hnal strongly connected component Cj induces a hnal 
transducer Tj which is strongly connected, complete, deterministic and 
Pj-periodic. Thus, the adjacency matrix at t = 0 of this hnal transducer 
has a dominant eigenvalue q'^ with right eigenvector 1. By the Perron- 
Frobenius theorem (cf. [T^ Theorem 8.8.1]), all dominant eigenvalues 
of this hnal transducer are \ I E V with p \ Ipj}, each with 

algebraic and geometric multiplicity one. 

A non-hnal strongly connected component induces a transducer S 
with the adjacency matrix S. This transducer is not complete. Let S'^ 
be the complete transducer where loops are added to states of S where 
necessary. The adjacency matrix of is S^. Since (S’*" is complete, 
deterministic and strongly connected, p{S~^) = g'^. As S < S~^ but 
S 7 ^ S'+, Theorem 8.8.1 in [13] implies p{S) < p{S~^) = g'^. 

Thus, the dominant eigenvalues are with an / G "P such that 

there exists a j G {1,..., c} with p \ Ipj. We determine the geometric 


multiplicities of these dominant eigenvalues of M in Lemma 3.2 


Now, hx a hnal strongly connected component Cj and some I G V 
with p I Ipj. In a small neighborhood of t = 0, let pij{t) be the eigen¬ 
value of the submatrix of M corresponding to the c omplete transducer 
Tj with pij{0) = Because of Lemma 3.1 applied to the hnal 

component Cj separately, we have pij{t) = e^^^^pjit) where Pj{t) is 
dehned to be poj(t). 

All other moduli of eigenvalues of M are less than min^j \pij{f)\ be¬ 
cause of the continuity of eigenvalues. 

□ 


We prove the diherentiability of the eigenvalues in Lemma 3.2 


Lemma 3.2. Let Pj(t) exp(^^) be a dominant eigenvalue of the matrix 
M. There exists a corresponding left eigenvector of M with zero entries 
except in coordinates corresponding to the final component Cj. 

At t = the algebraic and geometric multiplicities of q^exp{^^) 
coincide. 

Furthermore the eigenvalues and the eigenprojection corresponding 
to the eigenvalues pj exp{^^) are analytic att = t). 


Proof. Let g‘^exp(^^) be a dominant eigenvalue of M. Its algebraic 
multiplicity at t = 0 is |{j : p | /pj}|. We construct exactly one left 
eigenvector in the neighborhood of t = 0 for each hnal component Cj 
with p I lpj\ Let Tj be the induced transducer of the hnal compo¬ 
nent Cj. Let {t) be a left eigenvector of the adjacency matrix of Tj 
corresponding to the eigenvalue exp(^^). As the algebraic mul¬ 
tiplicity is 1 in this hnal component, the choice of is unique up 
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to multiplication with a scalar function in t. Then, we construct the 
left eigenvector v~^ (t) by padding (t) with zeros. 

These left eigenvectors are linearly independent because of the block 
structure induced by the hnal components. Thus the geometric and 
the algebraic multiplicities of g'^exp(^^) coincide. 

Furthermore, exp(^^) is a simple eigenvalue of the adjacency 
matrix of 7j. Therefore, lai Chapter II] implies the differentiability of 
the eigenvalues and eigenprojections. □ 

From now on, we use the convention that the eigenspace correspond¬ 
ing to fijit) exp(^) is the null space if exp(^) is not an eigen¬ 
value. Then its eigenprojection is the constant null function. 

Definition 3.3. Let wjj{t) be the eigenprojection of ej onto the left 
eigenspace corresponding to the possible eigenvalue exp(^). The 
vector is thus a null vector or a left eigenvector of M correspond¬ 
ing to the eigenvalue exp(—). 

Dehne 

C 

w^{t) =e7 

lev j=i 

As an abbreviation, we write wjj, w'^, w'jJ and w'~^ for these pro¬ 
jections and their derivatives at f = 0. 

Remark 3.4. If there are only dominant eigenvalues, then w~^{t) = 0. 
This will imply that there is no error term in the asymptotic expansion 
of the expected value and the variance. This occurs in the case of 
the sum of digits of the standard g-ary digit representation and other 
completely g-additive functions because the transducer has only one 
state. 


Lemma 3.5. In a fixed neighborhood oft = 0, let ^ > 0 be as defined 
in ([^, he., all non-dominant eigenvalues have modulus less than . 
Then 






'm 


for m, k > 0 and a constant 


Proof. Let P be the matrix such that h-)■ x~^P is the sum of the 
eigenprojections onto the left eigenspaces corresponding to Hj exp(^^) 
for j = 1, ..., c and I G P. Then w'^ = ej{l — P) and 

= ej{{l - P)M)^. 

As the spectral radius of {I — P)M is less than g'^“^, we obtain the 
stated estimates. □ 
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With wj defined in Section |2.3l we have 


(11) 


i=i 


Note that left and right eigenvectors corresponding to different eigen¬ 
values annihilate each othe r. Because of the block structure of the 
eigenvectors in Lemma 3.2 and because 1 is a right eigenvector to 
we have 


( 12 ) 


|; = 01A_i = wjji 


where Xj is defined in Section 2.3 Furthermore, w^l = 0 and 

j=l lav j=l 

Denote by 5 the vector whose s-th component is the sum of the 
outputs of all transitions leaving the state s. By the definition of the 
transition matrix M{t), 5 can be expressed as 


(13) 


iS = 

at 


t=o 


We now establish a relation between 5, the left eigenvector wJ and its 
derivative at f = 0. By definition of the left eigenvectors and 

( 0 , 

wJ{t)Ml = ^/ij(t)exp(^— 

j=i ^ 

Differentiation, (12), ([^ and ( pfi| yield 
(14) ^ ~ g^(e^ — 


To establish the interpretation of Wq given at the end of Section 2.3 
we consider 

wl ■= lim 

m—^oo 

the stationary distribution on the state space of all states of the trans¬ 
ducer under the assumption that the input length is congruent to k 
modulo p. Using © and Lemma [3.5| yields 


wl = lim 

m^oo 


E 

l£V 


w 


/'27rilk\ 


j^mp+k ^-d{mp+k) 


lim Vexp (- )wj + 

n^oo \ p / 


l£V 


= 0exp 
l€P 


/'2TTilk\ 


\ p J 


r^i 












20 CLEMENS HEUBERGER, SARA KROPF, AND HELMUT PRODINGER 


Summation leads to ^ Y^^k=o'^k = '*^0 • Thus, Xj is the hitting proba¬ 
bility of the hnal component Cj when starting in the initial state. As 
every state is accessible from the initial state, Xj is positive. 

67 -, which can be inter- 


Finally, for I = 0, (14) reads q 




Wq6 = 


preted as the steady state analysis of the expectation; the probability 
distribution is multiplied with the expected output q~'^5. 

3.2. Characteristic function. To obtain a central limit law in Sec¬ 


tion [3^ we compute an asymptotic formula for the characteristic func¬ 
tion in this section. 

The next lemma can be proved by induction on L. It is a general¬ 
ization of Lemma 3 in 


Lemma 3.6. Let A^, e = 0,..., g — 1 he matrices in H^ '. No —?• 

£nxn known functions with iLo(O) = 0. Let G: No —>■ he a 

function which satisfies the recurrence relation 

GiqN + e) = A,GiN) + H,{N) 

for > 0, e e {0,..., g — 1} and G(0) = 0. Then 


The solution of this recursion hnally leads to an asymptotic formula 
for the characteristic function. 

We choose the branch — tt ^ < arg^; < vr ^ of the complex 
logarithm. After setting f = 0, we use only the logarithm of complex 
numbers for which our branch coincides the principal branch —tt < 
arg2; < tt. 

Lemma 3.7. The characteristic function of the random variable T{n) 
is 


G {{El . . ■£o)q) — ^ 


n / 


E(exp( 2 fT(n))) = 






logoN 


l&V j=l 


exp 


2nil logq N 
P 


Tzi(log„ N,t) + R{N, t) 


with functions '^ij{x,t) (defined in (24)), which are arbitrarily often 
differentiable in t and 1-periodic in x, and an error term R{N, t). This 
error term R{N, t) is arbitrarily often differentiable, too, and satisfies 
^R{N,t) = N~^\o^ N), for k > t), a constant and the 

constant f > 0 defined in Section \2.3[ in a neighborhood oft = 0. At 
t = 0, we have R(N, 0) = 0. 


Proof. For a transducer T, consider the characteristic function 
(15) F{N) = E(exp(!«r(n))) = fj Y, 
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of the discrete random variable T{n). 

Then the snmmands in ( |I5| ) can be expressed as a matrix prodnct 

L 

e**r{^) = eJY[M,^u 
1=0 

where {el ■ ■ ■ £o)<j is the standard g-ary joint digit representation of n 
with £l 0 and the vector u has entries where b{s) is the final 
ontpnt of the state s. Again, the vector ei is the indicator vector of 
the initial state. 

Let 

L 

9{n) = 

1=0 

and 

G(A') = 5^ ain), 

ngOjv 

hence 

(16) F(N) = ielG(Af)ti. 

The fnnction g{n) satisfies the recnrsion 

(17) g{qn + e) = Meg{n) 

for £ e { 0 , 1 ,..., g — 1 }'^, n > 0 with gn + £ 7 ^ 0 . 

We define fnrther fnnctions 

(18) Gc(N)= Y. EsW 

0<ni<W ni=N 

i^C i^C 

where the coordinates ni, Ud of n with indices in the set C C 
{1,..., d} are fixed to N. This yields G{N) = G(ij{N). Fnrthermore, 
we define the matrices 

q—l £—1 

(19) 

ft=0 /3i=0ft=£ 
i^CUD ieD iec 

for disjoint sets G, D C { 1 ,..., d} and £ G {0,1,..., g — 1}. In this 
definition, we restrict the Tth coordinate f3i of f3 to he e or less than £ 
if i E G or i E D, respectively. Otherwise, the Ath coordinate can be 
arbitrary. Then, M = holds independently of £. 
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Then, (17) yields the following recursions for Gc{N), e: = 0, ..., 
g — 1, > 0 and C {1,... ,d}: 

( 20 ) 

g-1 

Gc{qN+ e) = 9iqm + f3) 

/3i=0 0<qmi+l3i<qN+e qmi+l3i=qN+e 
i^C i&C i^C i&C 

= [c = 0 A giV + £ ^ 0](J - Mo) 

0<mi <N+ 

t^C I&C - 1 I&C 

i^C 

= [G = ^AqN + e^0]iI- Mo) + M^c,dGcMN). 

DCC<^ 

This recursion for Gq only depends on Gq' for G' 2 C. As 

G(i.d|(A')=9(A'l). 


we can recursively determine Gc using Lemma 3^ In particular, for 
G{N), this yields the recursion formula 

(21) G{qN + e) = MG{N) + H,{N) 

for > 0, e G {0,... g — 1} where are known functions with 

(22) M,(iV) = [giV + £^0](/-Mo)+ MyGoiN). 

07^DC{l,...,d} 


Thus by Lemma 3.6, we get 


(23) 


G((£i ... £„),) = ({e,... £™+i),), 


m=0 


By construction, ||Me||oo = 1 for every e G {0, ...,g — 1}'^. We 
conclude that ||M^^^||oo < qd--\G-\D\^\D\^ gy definition of Gc{N), 
the growth rates of the functions Gc{N) and H^{N) are ||Gc(A^)||oo = 
( 9 (Ar'^-|C'|) ||i7g(A^)||oo = 0{N'^~^), respectively. For k > 0, the k- 

th derivative of H^{N) at t = 0 can be bounded by 00^^N'^~^ log*^ N) 


for a constant 
We define 




m=0 


which constitutes an explicit expression for the error term contributed 
by the non-dominant eigenvalues. By Lemma [375| its derivatives satisfy 
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for /c > 0. Because ^(O) = 1 and left and right eigenvectors correspond¬ 
ing to different eigenvalues annihilate each other, we have R{N, 0) = 0. 
By (@, Q and ej = wj- + w^, 

/27iilL\ 




lev j=i 


m=0 

+ R{N,t) 




V p 

27ril{m — L) 
P 


) ■ ■ ■ £m+l)q)'U- 




log„N 


jyd ^ ^ o 
ieP i=i 


exp 


27ril log„ N \ 

--^jvI/,^.(log^iV,t) + i?(iV,t) 


with 


(24) t) = pj{t) exp 

■ £hiW"™exp ... Xm-l)q)u 

m=0 P 

and = {xq . Xi.. .)q, choosing the representation ending on 0^ in 
the case of ambiguity. 

The functions ^ij{x,t) are periodic in x with period 1 and well de- 
hned for all x G M since they are dominated by geometric series. Fur¬ 
thermore, they are arbitrarily often differentiable in t. □ 


3.3. Moments. In this section we give the moments of the output sum 
Tin). 


Lemma 3.8. The expected value and the variance ofT{n) are as stated 
in Theore m [ij with constants given in (|^ and periodic functions given 
in Lemma 3.9 and (28). 


Proof. The derivative of E(exp(itT(n))) with respect to t at t = 0 gives 
the expected value of the sum of the output of the transducer 


E U") = «’o(log,/V)log,/V + 4'i(log,/V) 

+ C>(Ar-«logA^) 


with p-periodic functions 

C 

^^ ^ 2 'Kilx 

= ^^aje p d'zj(x,0), 

i&r j=i 

C 

\—^ X—^ ^TT^Za: . 

p d'zj(x,0) 

lev j=i 


( 25 ) 
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and constants aj defined in (|^. Here, denotes the derivative with 
respect to t. 

We now compnte \l/o( 2 :) for some x with = (xq . Xi.. .)g. To 
compnte H^(N), we nse ( [2l] ) and the dehnition of G{N) to obtain 

(26) H,{N)1 = {{qN + - iqN)'^)l 

for t = 0, becanse 1 is a right eigenvector of for every s. Together 


with (24), this resnlts in 
T,,(x, 0 ) = g-"fo> 

with 


exp 


27ri/{x} 


P 


w'lDiq^e 


D{z) = ""((xo . . . Xm)q -{xq... Xm-lO)J). 


m=0 


By (12), we have '^ij{x, 0) = 0 for / 7 ^ 0. 

To compnte D{q'^), observe that 

00 

((a^O -Xi... Xm)g - (xo . Xi . . . Xm-l)q) 

m=0 

= lim (xo . Xi . . . Xm)q = 

becanse D{q'^) is a telescoping snm. 

We conclnde that 


(27) 

and therefore 


Tz7(x,0) = \j[l = 0] 


'^' 0 ( 2 :) = = er 

i=i 


by ([^. This completes the proof of the expectation as given in ([^. 

Using Lemma 3.7 and (27), the second derivative of E(exp(itT(n))) 
gives 

‘ 5 ^ T(nf = 


N<i 


c 

fogg + vr logg N 

i=i 

/27ri/fog iV\ 

- 2* logg exp -j (fogg N, 0) 

i&r j=i ^ 

+ T 2 (fog,W) + C)(iV-«log2 N) 

with Vr given in (M and 


(28) 


V V X—^ X—^ 2'nilx ,, , 

^2{x) = -Y,Y.^ 

zeP j=i 
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Here, denotes the second derivative with respect to t. Thus, by ([^, 

the variance is 

(29) 


''(UU) = ^ 


E T(n?- E Tin) 

TIGQn \ 7l£ft]S[ 


i=i 


v-j — 2i EE ttj exp 
i&r j=i 


2Tiil logg N 
P 


%ilog^N,0) 


- 2erd'i(log, N)^ ^og^N 

+ T2(log^iV) - T2(log^iV) + 0{N-^\og^N). 

By Jensen’s inequality, the coefficient of log^ N is zero if and only if 
all Oj are equal. If all Oj are equal, then the coefficient of log^ N in (29) 
simplihes by (25), too, and we obtain (|^. □ 

For the computation of the Fourier coefficients and the proof of the 
Holder condition, we need an explicit expression for Ti. 

In analogy to the dehnition of Gc in (jl^, dehne 

(30) Bo(A')= E E'>(”) 

0<ni<N ni=N 
i^C iGC 

for C C {1,..., d}. 

Lemma 3.9. For = (xq .xi...)q, the fluctuation Ti(a;) can be 
expressed as 


(31) '^i{x) =—er{x} — q EE q ^'^fi{{xo...x^)q) 

with 


zg-p m=0 


fi{r) = [I = 0]er{[\oggr\{r‘^ - {q[rq ^\Y) + {q[rq ^J)'^) 

'2'Kil' 


(32) 


iw'^ 1 ( r'’' — exp 


P 


{q[rq ^J)'^ 


-w]B^{r) + q^eyip{^^^wjB^{[rq ^J). 


The estimate fi{r) = 0{r^ ^logr) holds. 

Proof. From (25), (24), (|^, (27) and (12) and the absolute convergence 
of '^ij, we obtain (31) with 


fi{r) = [I = 0]erLloggFj(r'^ - {q[rq ^J)”^) 
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+ i Hr mod q{[rq ^J)w(t) 


t=0 


From the combinatorial interpretation of b{n) and g{n)u{t), we ob¬ 
tain 


(33) 


ib{n) = ^g(n)u(t) 


t=0 


in analogy to (13). As the range of summation of Gc and Be coincides, 
we immediately get 


(34) 


iBc{N) = jGc{N)u(t) 


f=0 


By (26) and by differentiating He{N)u{t) using (21), (34) and (13), 

fi{r) = [l = 0]erLloggrJ(r'^ - 

-I- — {q[rq~^\Y) 


w 


J (B»(r) - AfB»(Lr«-‘J) - N"')"'*)- 


The fact that wj is a left eigenvector of M and (14) establish (32). 

For the growth estimate of fi{r), we use the explicit dehnition of 
in (22), (34) and the trivial estimate ||h(n)|| = C>(log ||n||). □ 

To formulate T(n) as a g-regular sequence, we hrst define output 
vectors. The s-th entry of the vector 6^ is the output label of the 
transition from state s with input label e. By ([T^, ([M|), and 


(35) 

we have 

(36) 


d 


t=0 


= 


h{qn + e) = M^b{n) + 5^- 


Remark 3.10. We can use the matrices 

/M, 6e [s = 0]A 

14 = 0 1 0 

\ 0 0 [e = 0]// 

andn(n) = (h(n), 1, [n = 0](h(0)— Moh(O) — <5o))''' in the dehnition of a 
g-regular sequence (|^ to realize that the output sum of a transducer is 
g-regular. If d > 1, then this is a multidimensional g-regular sequence 
(cf. H). 

3.4. Holder Continuity. In this section, we prove the continuity of 
the huctuations 4/i and 4/2 as well as the Holder continuity of 4/i. This 
will be used to establish the convergence of the Fourier series. 


Lemma 3.11. The functions 4/i(x) and, if all aj are equal, 4/2(t) are 
continuous for a; G M. 
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Proof. First note that continuity of \I/i for x G M with x = log y 


where y has no hnite g-ary expansion follows from the dehnitions (24) 


and (25). To prove it for x = log^ y with 0 < x < p where y has a hnite 
g-ary expansion, observe that the two one-sided limits exist due to the 
dehnition. Next, we prove that they are the same. Consider the two 
integer sequences N]^ = yq^^ and iV^ = — 1 for k large enough such 

that Nk is an integer. For a real number z, we write {z}p = p{z/p} for 
the unique real number in the interval [0,p) such that z — {z}p is an 
integer multiple of p. 

This yields 

lim {logg Nk}p = lim {log^ y pk}p = {x}p = lim { 2 ;}^, 


k^oo 


k^oo 




lim {log Nk}p = lim (log Nk + log (1 - iV/)} 


k^oo 


k^oo 


= lim {x -F logq(l - Nj_, ^)}p = lim_{^} 


k^oo 


P' 


If we insert the two sequences Nk and Nk in 

Tin) = erN'^logqN + N’^^iilog^N) + OiN'^-^ logN) 

(cf. ([^) and take the difference, we get 

OiNt^ logiV,) = iV,"vl/i(log,iV,) - iV,^vl/^(log^iV,) + OiNt^logNk). 
Because Ti(x) is bounded by a geometric series by dehnition, we have 
^lilog^Nk) - ^lilog^Nk) = OiN-^logNk) 
and in particular 

lim d'ldlog iVfcjp) = lim Ti({log iVfcjp). 

k^oo k^oo 

Therefore, Ti is continuous in x. 

The continuity of 4/2 (x) at x = logq(g) for y with inhnite g-ary 
expansion again follows from the dehnition of 4/2. If all Oj are equal, 
the continuity of the huctuation —4/{ -F 4)'2 of the variance (|^ follows 
as above, where log Nk has to be replaced by log^ Nk in the error terms. 
Thus T 2 is also continuous in this case. □ 


Lemma 3.12. The function 4/i satisfies a Holder condition of order a 
for all a G (0,1). 


Proof. Let 0 < a < 1 be any constant. We want to prove that there 
exists a positive constant C such that 

(37) |4/i(p)-4>i(x)| ^CIp-xI^ 


holds for all X, p G M. 

For X = y, the left-hand side of (37) is 0 and the inequality is obvi¬ 
ously satished. From now on, assume that x < y. By the periodicity 
of 4/i, it is sufficient to prove (37) for 0 < x < p. 
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First, we prove (37) for the case Q < x < y and sufficiently small 
y — X <1. 

Fix such X and y and choose the integer k such that 


q 


-k-l 


< <q 


-k 


Note that the continuous differentiability of z ^ on the compact 
interval [0,p + 1] implies that = 0{\y — x|) and therefore 

(38) q->^ = 0{\y-x\). 


We prove (37) in three steps. 

Statement 3.13. Let a, b eM. with x < a < b < y and [aj = [6J such 
that the first k + 1 digits of the expansions 

gW = (oo . oi.. .)g, = (bo-bi.. .)q 

coincide, i.e., Oi = bi for 0 < i < k. Then 

\^,ib)-'^^{a)\=0{\y-xn. 

Proof. Lemma 3.9| yields 

I'tiW-'l'iWI < lerllW-WI 

■EE q-^^mbo...bJq)-M{ao...am),)\ 


Zg'P m>0 

-.W|EE 

ZeP m>0 


\q 


<KII{ 6 }-W| 

+ EE V''”(l/<((''o... 6 „),)| + |/,((ao...a„),)|) 


-.W|EE 


lev m>k 

+ \q' 

lev m >0 

because the summands for m < k cancel in the first sum as the first 
k + 1 digits coincide. By using the estimates 

IW-WI<IW-Wr = 

_g-dW| = c>(|6-a|"), 
mbo...b^)q)\ = 0{q^‘^-^^^m) 

(see Lemma 3.9 for the last estimate), we obtain 

|^i(6) -^i(a)| = OUb-a\^+ ^mq-^ P\b-a\A 

m>k ' 

= 0{\b - a|“ + kq-^) = 0{\b - a|" + g-“^) 

= 0{\b - ar + \y- xr) = 0(\y - x|“). 


Here, (38) has been used in the penultimate step. 


□ 
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We now use the continuity of \l/i and Statement 3.13 to remove the 


condition on coinciding digits from Statement 3.13 


Statement 3.14. Let a, 6 G M with x < a < b < y and [aj = [foj. 
Then 

\^i{b)-^i{a)\=0{\y-xn. 

Proof. We write the expansions of and as 

= (ao . ai.. .)g, = {bo . bi 

This yields 


• -Jq- 


0 < q 


{b} 


q{a} = 


1 


q 


[aJ 


{q^ -q^) <q^ -q^ <q^ -q^ <q 


-k 


Thus 


0 < {bo...bk)q - {ao...ak)q < 1 . 

If {bo - ■ ■ bk)q = (ao ■ ■ ■ ak)q, the result follows immediately from State¬ 
ment 3.13 Otherwise, we have 

(39) (bo ■ ■ ■ bjfjq (Oq . . . ttffjq 1. 

For m > 0, dehne 2 ; and Zm by [z\ = Yzm\ = [aJ = [&J and 

= (bo.bi... bk)q, 
qPrn} = (ao . ai... ak{q - l)"")q- 


Then lim„ 


,Zm = z because of (39). 


By construction of 2 ; and 2 ;^, we have a < Zm < z <bioi sufficiently 
large m. 

By continuity of Ti, 

(40) \'^i{z)-'^i{Zm)\<\y-x\°‘ 

holds for sufficiently large m. 

This yields 

|Ti(6) - Ti(a)| < |Ti(6) - Ti(z)| + |Ti(z) - Ti(2;,„)| 

+ |fE'l(^m) ~ ^l(®)|- 


The third summand can be bounded by Statement 3.13 (for a and 
Zm) and the second by (40). The hrst summand is either 0 or can be 
bounded by Statement 3.13 (for 2 ; and b). □ 


To hnally prove (37) for sufficiently small y — x < 1 , we only have 
to remove the assumption [aJ = [6J from Statement 3.14 We use the 
idea of the proof of Statement |3.14| once more. 

Assume that [y\ > [xj. By our assumption y < x + 1, this amounts 
to [i/J = [xJ -|- 1. For m > 0, dehne z and by 2 : = [y\, [_Zm\ = L^J 
and = {{q — 1) . (g — l)™')^. Then lirnm^-oo Zm = By continuity 
of Ti, we have 


(41) 


|Ti( 2 :) - '^lizm)] <\y-x\ 
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and X < Zm < z <y ioT sufficiently large m. 

Then, this yields 

|Ti(i/) - Ti(a;)| < |Ti(i/) - Ti(^)| + |Ti(;2) - 

+ I'^iizm) - d'i(x)|. 


The third summand can be bounded by Statement 3.14 for x and Zm 
and the second by (41). The first vanishes or can be bounded by 
Statement 3.14 for z and y. 

This yields 

|^i(l/) - =0{\y-x\^). 


Therefore, (37) is satisfied with a suitable positive constant C for y — 
X < e ioT some e > 0. 

Assume y — x>e. As 4 /1 is continuous and periodic, |4/i(?/) — \['i(x)| 
is bounded. Thus, (37) holds for a suitable positive constant C for 
\y — x\ > e. 

Therefore, the function 4/i is Holder continuous of order a < 1. □ 


3.5. Limiting distribution. Finally, we can prove the parts of The¬ 
orem concerning the approximation of the distribution function and 
the central limit theorem. 


Proof. To prove that the distribution function can be approximated by 
a Gaussian mixture, we use the Berry-Esseen inequality (cf., for in¬ 
stance, m Theorems IX.5]) to estimate the difference between distri¬ 
bution functions. The proof follows the proof of Hwang’s Quasi-Power 
Theorem |23j. First, we describe the two corresponding characteristic 
functions. 

Let gN{t) be the characteristic function of a mixture of Gaussian or 
degenerate distributions with weights Xj, means Oj y/logg N and vari¬ 
ances bj for j = 1,..., c, that is 


9N{t) = Xj exp (iaNt - 


with ttj, bj an d Xj defined in ([^. 


By Lemma 3.7, the characteristic function /Ar(t) of T (n)/-^/logg N 


IS 


Mt) = {ia,^log,Nt - + 0(-x=) 

j = l V 6 




i&v 


yiogTiv 


+ R[N, 


y/\ogN 


_ 1 

for t\ogq ^ N in a fixed neighborhood of 0. 
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Because of 


and R{N,0) = 0 (see Lemma 3.7), we have 


hit) = 5]]exp (iajJlogqNt - exp (o 


i=i 

'a, + O 


A/log iV 


t 


+ o(N-k^/\ogN 


Now we use the inequality |e"' — 1| < valid for all complex 

numbers w, to obtain 


(42) -{fN{t) - gN{t)) 


j=i 


bi 9 

+ 0{ , 

2 V A/log N 


0{N-^ \og-"^ N) 


for flogg ^ in a small neighborhood of 0. 

From now on, we assume that hj ^ 0. There is a small neighborhood 
_ 1 

of 0 for t logg ^ N such that 


0{ exp - + 


VlogiV / 


= 0(exp( 


holds. 

This yields 

1 
t 


{hit) - gN{t)) = ^C>(^exp (^-+C>(iV ^ logs AT). 

j = l V 6 

Now, the Berry-Esseen inequality with T = cydog)jlV for a small 
constant c > 0 (cf., for instance, [T^ Theorem IX.5]) implies that 

sup \Fn{x) — Gn{x) \ = of 




hogN 


where Fjy is the cumulative distribution function of T(n) and Gat is 
the cumulative distribution function of the mixture of Gaussian distri¬ 
butions. 

If all Gj are equal and bj > 0, Gn is the distribution function of a 
mixture of normal (or degenerate) distributions with mean e-j-^^log)^ 
and variances bj > 0. After subtracting the mean, (42) converges to 0. 
Thus, 

r(n) -E(r(n)) 

V^ogg N 

converges in distribution. If all bj > 0, then the same estimates as 
above yield the speed of convergence. □ 

This completes the proof of Theorem 
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4. Fourier Coefficients — Proof of Theorem [2] 

This section contains the proof of the theorem about the Fourier 
coefficients. First, we investigate some Dirichlet series which we will 
use later. Then, we prove the formulas given in Theorem We use 
the Holder condition for Ti to prove that its Fourier series converges. 


Lemma 4.1. The Dirichlet series 

L{.z) = ^ [logg d - ^Y)r~^ 


r>l 


is meromorphic in iR.z > d — 1 with poles in z = d+ for / G Z. The 
main part at z = d is 


d 


d 


{z — dy\ogq 2{z — d) 

and, for I ^ 0, the residue at z = d + is 2 ^- 
Proof. First, we use the binomial theorem to obtain 

(43) L{z) =dL,{z-d + l)-^ {-ly-^Hz - j) 

j=o 

with Li = X]r>iDirichlet series Li{z) is holomorphic 
for iRz > 1. Thus, the second summand in (43) is holomorphic for 
iR.z > d — 1. To obtain the expansion of L{z) at 2 ; with 'Rz = d, we 
investigate the Dirichlet series Li[z) at ^z = 1. 

Let /c > 0 be an integer. We use Euler-Maclaurin summation with 
f{x) = kx~^ to obtain 

„fc+i 




[logg r J 


qk^j.^gk + 1 


kx-^ dx - 

yk 2 


nk + 1 


— kz 
1 

1-z 


/ Bi{{x})x ^ ^ dx 

I qk 

(^kqp+m-p _ kq^i^-P) 


- - kq-’^^ 

^gk + l 


— z 


Bi({x})x ^ [log (a;)J dx 


where Bi(x) is the first Bernoulli polynomial. For ?R:z > 1, summation 
over k > 0 yields 

LiO) = rf-E - 1) - fc) 


k>l 


k>l 
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/ OO 

Bi{{x})x~''~\logg{x)\ dx 
11 11 

= z _ 1 qz-i _ 1 + 2^^ “ ^ / 5i({x})a;-^-Hlog,(x)J dx. 

The second summand and the integral are clearly holomorphic for > 
0. Thus, Li{z) can be continued meromorphically to ^z > 0 with poles 
coming from the hrst summand. 

The expansion around z = 1 is 


z — 1 — 1 


+ 0 ( 1 ) = 


(z — ly \ogq 2(z — 1) 


+ 0 ( 1 ). 


Thus, by (43), we obtain the main part and the residues of L(z) at 
z = d + for / G Z as stated in the lemma. □ 

logg 


Lemma 4.2. The Dirichlet series 

Z{z) = ^(r-^ - (r - lY)r-^ 

r>l 


is meromorphic in C with simple poles in z = j, j G {1,..., d} with 
residues J (—1)'^~T 

Proof. The binomial theorem yields 

i=o 

where ( is the Riemann (^-function. The result follows from the unique 
pole of ({z) ed. z = 1 with residue 1. □ 


Denote by ^(^, 0 ;) the Hurwitz (^-function 

C{z,a) = {r + a)~\ 

r>—a 

Furthermore is the digamma function. 


Lemma 4.3. For 0 < a < 1 and and an integer 0 < j < d — 1, the 
Dirichlet series 

= ^ + (r + a)"^ 

r>l 

is analytic for IR.z > j + 1. For j = d — 1, it is meromorphic for 

iRz > d — 1 with a simple pole at z = d with expansion 

(44) 

J{z, a, d — 1) =- - — y{a + [a = 0]) — [a > 0 A d = l]a“^ 



i-aY-^-^C{d-k,a) + 0{z 


d). 
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Proof. As r^{r + a)~^ = J is analytic for 'Siz > j + 1. Now, 

let j = d — 1. 

The binomial theorem yields 


J{z,a,d — 1) = '^^{r + a — a)‘^ ^(r + a)" 


r>l 

d-1 

E 

k=0 

d-1 

Z 

k=0 


d-1 


d — 1 


-a 


,d—l—k 


^(r + 


r>l 


(—a)'^ ^ ^{C,{z — k,a) — [a > H]a 


d-2 


— C{z — d + 1, tt) + ^ ^ 


k=0 


d-1 


(-ay ^ ^((z-k,a) 


— [a > 0 A d = l]a 
Using the expansion (cf. jMl P- 271]) 


C(z,a) = 


z-1 


'ip(a + [a = 0]) + 0(z — 1) 


yields (44). 


□ 


Lemma 4.4. Let fc G Z. The Dirichlet series 

OO 

B(z) =w'l^ (B^(r + 1) - 2B^(r) + B^(r - 1)) 


r=l 


is analytic for 'Liz > d — 1. 


Proof. By the definition (30), we have 
(45) B^ir + 1) - B^ir) = ^ Bc{r), 

which can be bonnded by ||Sc(r)|| = 0{r'^~^\ogr). Thns, 

B(z)=wJ 5^(-Bc(r) - Sc(r - l))r-^ 

0^C'C{l,...,d} r>l 

which converges for Liz > d — 1 by O Theorem 8.1]. 

The vector-valned fnnctions Hc(z) are defined by the Dirichlet series 


□ 


(46) 


ifc(=) = 5^Sc(r), 


r>l 


By ([^ and ( jisj ), this yields 
(47) H(z) = Hc(z) = 


^(S8(r + 1) - Bt(r))r 

07 ^C'C{l,...,d} r>l 

Next, we investigate the Dirichlet series He- In particnlar, we de¬ 
termine its behavior oX z = d + Xk and provide an infinite fnnctional 
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equation to compute its residues at these points. This will hnally give 
us the residues of iT in Q. We use a similar method as Grabner and 
Hwang in |16| . 

For this inhnite recursion, dehne 


(48) 


skn- i: EE'*-’. 

i^CUD i£D i£C 


in analogy to the dehnition of Mq As before, the s-th entry of 6^ is 
the output label of the transition starting in s with input label e. Then, 
S = ^0 0 holds independently of e. Furthermore, 
by (ISSl). 


Lemma 4.5. Let C ^ tli. For > d and C ^ the Dirichlet series 
Hc{z) satisfies the following infinite recursion 


(49) 


Q -1 

e=0 

q-1 q-1 

J2Bc(EK‘ + q-‘ Y1 'F.^Zc^dHcudO) 

6=1 £=o 

q-1 

+ q- Y, Y‘>i.Djfi-,d-\D\-\C\) 

e=0 ^ 

DCC<^m>l h / £=o 


It is analytic for Ikz > d — \C\ + 1. For \C\ = 1 and k ^ 0, wjHc 
has a possible simple pole in z = d + Xk with residue the right-hand 


side of (49) evaluated at z = d + Xk and divided by logg. For \C\ = 1, 
vofllc has a possible double pole with main part 


er 


logg {z — dfi 


2 log q) z — d 


where he is given in (52). 


Remark 4.6. The inhnite recursion (49) can be used to numerically 


compute the values of iTc and its residues at ^ d + with arbitrary 
precision. It numerically converges fast if the hrst terms of the Dirichlet 
series He are computed explicitly. 


Proof. As Be{r) = 0{r^ logr), the Dirichlet series He is analytic 
for iPz > d — \C\ F 1. 
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By multiplying (20) with u{t), differentiating with respect to t at 
t = 0 and using (34), ( [I^ and (48), we obtain the recursion 

(50) Bc{qr + £) = ^^oBcuDir) + Sc, 






for C 7 ^ 0, {1,..., d} and gr + e: > 0. By ( [It) ), this recursion is also 
valid for (7 = {1,..., d] and gr + £ > 0. 

By (50), we have 

q-l q-l 

Hc{z) = ^ Bc{e)e-^ + ^c{qr + e){qr + e)-^ 

e=l £=0 r>l 

9-1 


£ = 1 


q-l 


E Y.Y.(^‘’c.DBcuD(r) + 


r 


d-|D|-|Ch 


DCC^^ £=0 r>l 


■g-V 1 + — 

V gr 

for C 7 ^ 0. Expanding (1 + e/{qr))~^ as a binomial series yields 

q-l 

Hc{z) = Y,Bc{e)e-^ 

+ ei:ee 

DCC^^ £=0 r>l m>0 
q-l 

+ E J2^h,DJ[z,-,d-\D\-\C\ 

DCC= £=0 ^ 

q-l q-l 

= Sc(£)£-'+ 

£=1 DCC= £=0 

q-l 

+ q-’ J2^’c.DjU-,d-\D\-\C\ 


m 


MlDe^q-^-^BcvjD{r)r- 


£=0 


EE 

DCC= m>l 


— Z 

m 


e 

q'' 

q-l 


^ M^c,d^'^Hcud{z + m) 


£•=0 


for > d and C 7 ^ 0. Collecting Hc{z) on the left-hand side results 
in (49). 

To compute the residues of wjHc for \C\ = 1 at z = d + Xki note 
that X]e=o ^C9 ~ ^ holds independently of C. 
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We multiply (49) with the left eigenvector wj which results in 
(51) 


q-1 


W, 




S=1 


q-1 


+ 9 J2^c,dHcud{z 

$^DCC<^ e=0 
q-1 


DCC<^ e=0 


W, 


EE 


m 


e 

q 

q-1 


q~ 


■J2Mh,D^"^HcuDiz + m). 


DCC^m>l ^ ^ £=0 

As |C U Zl| > 2 or 3?^ + m > d, all i^cuD used on right-hand side 
of (51) are well defined for > d — 1. The Dirichlet series J have 
simple poles at z = d for ICI = 1 and D = tj) (Lemma 4.3). Thus the 
right-hand side of ( [5l| ) is meromorphic for 'Siz > d — 1 with a simple 
pole at z = d. 

The factor 1 —g‘^“^exp(^^) has a zero exactly for z = d + Xk, k G Z. 
Thus for fc 7 ^ 0, wjHc has a possible simple pole at ^ = d -|- x^. Its 
residue is the right-hand side of (51) evaluated at z = d + Xk divided 
by logg. 

If fc = 0, we have z = d. In this case the expansion of the right-hand 
side of (51) is 

er 

with 


z — d 


+ he 0{z — d) 


q-1 


(52) he = -er log g - g ^ ^c, 0 ^ (“ + [^ = 0] 


q-1 


= i]^o 


£ = 0 


£ ^—1 


C,0^ 






£ = 1 

q—1 d—2 

'^0 ^c,0 


d — 1 
k 


c\ d-l-k / P 

-) dd-K- 

qJ \ q 


£=0 k=0 

q-1 q-1 

+ 'wl'^Be{e)e-'^+ q-‘^wl ^ '^M^.j^Heunid) 

£=1 07 ^DCC‘= £=0 

q-1 

+ q-^w^ Y.^hDj{d,-,d-\D\ 

07 ^DCC= £=0 ^ 
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Wn 


I ^ \ „-d-m 


DCC= m>l 


q-1 


J2Mh,D^^HcuDid + m) 

where we used the expansion of J in Lemma 4.3, 5 = Yll=o^c9 


(14). 


□ 


From the previous lemma and (47), the residues of the Dirichlet 
function H follow. Only He with \C\ = 1 contribute as all other 
summands are holomorphic. 

Lemma 4.7. The Dirichlet function H is meromorphic in > d — 1 
with possible simple poles at z = d + Xk, k ^ 0 and a possible double 
pole at z = d. 

The residue at z = d + Xk, k is 

d / (?—1 


logg 




-d-Xk 


j=l \ £=1 
-d-Xk 


q-1 


E E ^{j},D^{j}UD{d + Xk) 

0/DC{j}c £=0 

E 12^l,i.oJid+xD-,d-\D\ 

DC{jY £=0 ^ 


E E 

DC{j}= m>l 




m 


q-1 \ 

+ m + Xk) 1 • 

£ = 0 / 


The main part at z = d is 

erd 1 
logg {z — 


e-j-d ^ h^jj 


+E 


logg/ z — d 


where h{j} is defined in (52). 

Now we can prove the formulas for the Fourier coefficients. 

Proof of Theorem The periodic fluctuation Ti of the expected value 
is a p-periodic function. We use the explicit expression of 4/i given in 
Lemma 13.91 

Due to absolute convergence, the fc-th Fourier coefficient of \I/i(a;) is 


1 n 

Ck= - 
P Jo 


4/i(a;)e 


er j . - 

= -/ Me 

P Jo 


i^^dx 

OO 

V—^ V—^ 7 27riZm 

di - E E /,,, 


2'!rik 

V 


lep m=0 
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with 






fl{{Xo. . .Xm)q)dx 


and = {xq .Xi.. .)q. The value of the hrst integral is given by — ^ 
for k = 0, and \k = 0 mod pi—^— otherwise. Thus, we focus on the 
second integral Ii^rn- 

First, we partition the interval [0,p) into intervals [r, r + 1) for r = 0, 
p — 1. After simplifying the sum of p-th roots of unity, we obtain 


Il,m — 



q '^''fiiixo . ..Xm)q)e p ""dx. 


After partitioning the interval [0,1) into the intervals [log^ r — m, 
logq(r + 1) — m) for r = g™, ..., q^+^ — 1, the function fi{{xo ... Xm)q) 
is constant on the interval of integration. Therefore, we obtain 


l£-p m=0 


—md — 


1 


(d + Xk) log q ^ 
Next, consider the function 


ft .»»=i p (’■) - (’■+■ 


.4(j) = J]] /t mod , (r) (r * - (r + 1) *). 


r=l 


We know that fi{r) = 0{r'^~^\ogr). Thus, A{z) is analytic for > 
d — 1. 

By summation by parts, we can rearrange the series for > d and 
obtain a sum of Dirichlet series 


(53) A{z) = [p I k]erSi{z)+iw'^ lS' 2 (z)-S' 3 (z) + g'^exp 


2 mk\ 


p 


)S,{z) 


with coefficients si(r), S 2 (r), S 3 (r) and S 4 (r) respectively. These coeffi¬ 
cients are differences of the four summands in mod p(f) and fk mod p(f— 
1) in (32), respectively, e.g., 

si(r) = [logg(r)J(r'^ - {q[r/q\Y) + {q[r/q\Y 

- [r > l]([logg(r- l)J((r- 1)'^- (gL(r- l)/gj)‘^) 
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After some simplifications using — [o' | f] and [logq(r — 

1)J = [log^rj — [r is a power of q\ (for r > 2), we obtain 
(54) 

si(r) = [logqrJ(r'^ - (f - 1)'^) 

- [q. I r]q'^[^og^rq-^\{{rq-^Y - (rg“^ - 1)'^) 

+ [r 7 ^ 1 is a power of q\{{r — 1)'^ — (f — qY), 

s^{r) = r‘^-{r- lY - [q \ r]q^ exp ((rg"^)'' - (rg"^ - 1)'='), 

S3(r) = wl{B^[r) - B^{r - 1)), 

S4(r) = [q I r]wl{B<i{rq-Y - B<i{rq-^ - 1)). 

For > d, we can split up the summation into the different cases 
in (54). This yields 


d-l 


SiW = (1 - 

j=0 

/'2mk\ 


- 1 ) 




d-j 


1^-3 


S,{z) = 1 


f] _2 

g exp 


V p 

Ssiz) = wjHiz) - B{z), 

S,iz) = q-^wlHiz)-q-^Biz) 


))Z{zh 


where we used (45), (47) and the Dirichlet series defined in Lemmas 4.1 
14.21 and 14.41 


Thus, in (53), we obtain 


d-l 


A(z) = [p\k]erY^ 

3=0 


(55) 


iw 


/T 


1(1 -g 


d 
J 

d—z 


- 1 ) 


d-3 


1 - 


1^-3 


(1-g' 


d—z 


e p 


-)Z(z) 


+ [p I «:]er(l -/-OiW 


+ (1 - ? 


d—z 


e p 


')B(z). 


^d—z. 


We want to evaluate A z = d + Xk- The factors 1 — g“ "e p 
are zero if and only ii z = d -\- Xk- Thus, the following Dirichlet series 
contribute to (55): 


The Dirichlet series Z only contributes if A; = 0 (Lemma 4.2). 
The Dirichlet series wjH has poles at z = d + Xk for k & "L 
(Lemma 4.7). The possible double pole at z = d cancels with 
the one of L. 
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0|0,...,g-l|g-l 



Figure 7. Transducer for the g-ary sum-of-digits function. 


The residue of the Dirichlet series L contributes to the Fourier 
coefficients (Lemma 4.1). The possible double pole at z = d 


cancels with that of w^H. 

• As the Dirichlet series B converges for > d—1 (Lemma 4.4), 
it does not contribute to the Fourier coefficients. 

As the second order poles of and L cancel, the right-hand side 


of (55) is well dehned for the limit z ^ d + Xk- After computing the 
limit and simplifying the summation, we obtain ([^. 


Then Lemma 3.12 and Bernstein’s theorem (cf. p. 240]) imply 


the absolute and uniform convergence of the Fourier series. 


□ 


Now we use Theorem]^ to prove Corollary |2.5[ 


Proof of Corollary \2.^ The transducer in Figure [^computes the q-ary 
sum-of-digits function Sq (n) and we can use Theorem 
We transform the Dirichlet series 

D{z) = ^^(sq(m) — Sq{m — 

m>l 


in two different ways. This series is absolutely convergent for ^z > 1. 

First, we can rearrange the summation of the Dirichlet series D{z) 
such that the Dirichlet series H{z) = 
appears. We have 


^m>l 


[m)m dehned in (46), 


(56) 


//(z) - 11 = O (2"*' + ^ m"*' log m) 

m>3 

= 0(2-^^+ j x-^^\ogxdx 

J 2 

= 0(2-^^) 


for > 1. By partial summation, we obtain 

D{z) = 1 — 2“^ + ^ Sq{rn)(rn~^ — (m + 1)“^) 

m>2 

= 1 — 2“^ + ^ Sg(m)m“^(l — ( 1 -I-^). 

m>2 
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Expanding the binomial series yields 

D{z) = 1 — 2“^ — ^ Sq{m)m~^ ^ 

(67) 


m>2 


1>1 


— Z 


m 


-i 


= i-2--E 


«>i 


— 2 ; 


{H{z + l)-l). 


By (57), we have 


D{z) = 1 — 2 ^ + zH{z + 1) — z — ^ 

l>2 

which is equivalent to 

H{z + 1) = -Diz) + -(2-^ - 1 ) + 1 - ^ 
z z 

l>2 


(H{z + l)-l) 


1 f-z - 1 


l \ 1-1 


{H{z + l)-l) 


for ^z > 1. The sum on the right-hand side is holomorphic at ^z = 0 
because of (56). By meromorphic continuation, this equation also holds 
for 3^2; = 0. This yields 

(58) Res 2 =i+^j^ H{z) = ReSz=Xk H{z + 1) = Res 2 =;^^ ^D{z). 

z 

On the other hand, we split up the summation in the definition of 
D{z) into the q equivalence classes modulo q and we use the recursionr® 

Sq{qm + e) = Sq{m) + e 

for Q < e < q. This results in 

Sg{m) — Sq{m — 1) = 1 -|- [g I m] (sg [q~^m) — Sq {q~^m — l) — g) 
for m > 1. Thus we obtain 

D{z) = '^[l + [q\m\ [sq {q~^m) - Sq {q~^m - l) - q)) 


m 


m>l 


= az) + q-^D{z)-q^-X{z). 

Thus, we obtaiijj 

(59) 

This formula yields 

(60) 


1 — q^ ^ 

D(z) = Y3^C(^). 


q — 1 

D{z) = -jzz::C{Xk)- 


\ogq 


For /c = 0, we further use the expansion 


C(2:) = -\-\ log(27r)^ -1- 0{z‘^) 


^Actually, these recursions are (36). 

^Note that this well-known identity can also be derived from Sq{m) — Sq{m— 1) = 
1 — (q — l)vq{m), where Vq(rn) is the g-adic valuation of m. 
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(cf. in 25.6.1 and 25.6.111) and (59) to obtain 


( 61 ) D(.) = ^ + 0 (z). 


2 z log q 


2 logg 


Thus, by (56) and (60), we obtain 


Res7= 


1 q — 1 

z=i+x, H{z) = — Res,=^, D{z) = - . CiXk) 
Xk Xk y 


for k ^ 0. For k = 0, ( |61[ ) and ( |58[ ) yield 

o Tjr \ (? - 1) log (27r) g + 1 

Res^=iif z =-—-—. 

2 log g 4 

Now, ([^ with er = ^ and w'q = 0 yields (|^. 


□ 


5. Non-Differentiability — Proof of Theorem [3] 


In this section, we give the proof of the non-differentiability of 4/i(a:). 
We follow the method presented by Tenenbaum |32], see also Grabner 
and Thuswaldner mi. 


Proof of Theorem\^ Let r = (r^-i • • • Fo)g be the value of the reset 
sequence (r^-i • • • fq) leading to state to . 

Assume that Ti is differentiable at x G [0,1). Let g^ = • • •)<? 

the standard g-ary digit expansion choosing the representation ending 
on 0^ in the case of ambiguity. Further, let Xk be such that q^^ = (^o ■ 
El... ek)q- Thus, we have hmA;^.oo Xk = x. For / G {0,1}, the function 
Lj: Z —)• Z is defined as Lf{k) = ck + f with c a positive integer such 
that C > I — 1. Define Nk = q^k+k+L}{k) _ y^ck+-^Xk-m- 2 y 

Let Hk and Zk be such that Nk + = qV^+k+Lfik) _|_ 

qCk 7Ti 1^ I k(^k^ — q^k~^k-\-L (^k'j 

From these definitions, we know that 


h{k) 


log Nk 


o{h{k)) 


for fc —)■ oo. Apart from Xk, also, yk and Zk converge to x and satisfy 
the following bounds: 


Zk - Vk 


j_MP+o 

logg Nk 


\yk - Xk\ = 0{q 

X- Xk = 0{q~^). 



Now, we compute 

^ ’ n&Nk 
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in two different ways where A4 = {n G Z | Nk + < n < 

Nk + + h{k)}. 

First, observe that | Nk and h{k) < Thus, the digit 

representations of the three summands in Nk + + n are not 

overlapping at non-zero digits for n < h{k). Since the digit expansion 
of r is a reset sequence, we have 


TiNk + + n)= eJbiNkq-^^^^^) + + n) - h{iy) 

where eJh(iV) is the output of the transducer when starting in state u 
with input N and b{u) is the hnal output at state u. 

Thus, we have 


1 


E Tin) 

n&Afk 


1 

^k) 


'T{Nk + + n) 

0<n<h{k) 


elb{Nkq-^'^^^)-h{p) + 


1 

h{k) 


r{q^^-^-h + n) 

n<h{k) 


where only the hrst summand depends on Lf{k) and hence on /. 

Taking the difference in (|^, there is a second way of computing the 
sum in ( [6^ . Using the periodicity and continuity of Ti(a;) yields 
(63) 

T{n) = {Nk + q^'^-^-^r)eT{zk - Vk) + h{k)eT{x + k + Lf{k)) 

n&Nk 

+ {Nk + - Ti(2/fc)) 

-I- h{k)^{x) + o{h{k)). 


Next, we use our assumption that Ti is differentiable at x to replace 
the difference by the derivative 

Ti(zfc) - Ti(|/fc) = '^[{x){zk - Vk) + o{\zk - a:|) -|- o{\x - yk\). 


Now, we insert this into (63), divide by h{k) and obtain 
er 


h{k) 


En 

n&Mk 


n] = 


'rer{x + k + Lf{k)) + -T^(a;)-hTi(x)-ho(l). 

log q log q 


Thus, we have the following equality 


elb{Nkq-^^^^)-h{v) + 


1 


Y T{q^’^-'^-h + n) = 

n<h(k) 


C'T 1 

--h er(T + k + Lf{k)) + - -tki(^) + ^i(^) + o(l) 

log q log q 

twice, for / G {0,1}. Subtracting these two from each other yields 
eT^(^x.+/c+2) - ej6(g"''+'=+i) = er + o(l). 
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Since the left-hand side is an integer, but the right-hand side is not for k 
large enough, this contradicts our assumption that d'l is differentiable 
at X. □ 


6. Recursions — Proof of Theorem [4] 


In this section, we construct a transducer associated to the sequence 


defined by the recursion in (10). All inequalities, maxima and minima 


in this section are considered coordinate-wise. 
Define the function A: Ng —?• Nq U {oo} by 


A{q^n + A) = 


-|- Vx if > 0, 


oo 


else 


for 0 < A < q'^1 and n > 0. So, if A{n) < oo, then the recursion (10) 


can be used for this argument because the argument on the right-hand 
side is non-negative, i.e., a(n) = a(A(n)) -|-tri mod • 

First, we construct a non-deterministic transducer T. A priori, it 
has an infinite number of states; later, we will prove that only finitely 
many of them are accessible. We then simplify it to obtain a finite, 
deterministic, subsequential, complete transducer T. 

The set of states of T is 


{(/, \lez<^,je No} u {(/, j)iv No}. 

The initial state is (0,0)ir; all states {l,j)F are final states with final 
output a(Z) if Z > 0 and final output 0 otherwis^ As an abbreviation, 
we will frequently speak about “a state if we do not want to 

distinguish between {l,j)F and We call I the carry and j the 

level of the state A state {l,j)F is called simple, if it is final, 

I > 0 and j < K. 

There are two types of transitions in T, recursion transitions and 
storing transitions. Each state is either the origin of one recursion 
transition or of storing transitions. 

There is a recursion transition leaving {l,j) if 

• j > K and 

• A{q^n + 1) < oo for all n > 0 with n ^ 0. 

In that case, we write I = q^s + A for a 0 < A < g'^1 and the transition 
leads to the state {I ' a')n with i' = nx+ i — K and l' = q^^s + rx. The 
input label is empty, the output label is tx- Thus 

( 64 ) A(q^n + 1) = q^'n + I' 


®In fact, we will prove that a path with valid input will never end in a state 
{l,j)F with 1^0, but the framework of subsequential transducers requires us to 
specify a final output even in that case. The non-final states {l,j)N will disappear 
in the reduction to T anyway. 
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for n > 0 with n 7 ^ 0. Note that (64) holds for n = 0 if and only if 
I > 0 and I' > 0 . 

Otherwise, there are storing transitions from {l,j) to {q^e + l,j + l)F 
with inpnt s and ontpnt 0 for all 0 ^ s ^ ql. 

For 


We now dehne the classes annonnced in Section [2^ 


each accessible cycle in T with simple states and inpnt 0 , the carries of 
its states form one of these classes. The other classes are the singletons 
of those carries Z > 0 in the accessible part of T with A{1) = 00 . These 
sets will tnrn ont to be disjoint by Lemma 6.6 and the hniteness of K 
will follow from the hniteness of the accessible part of T (Lemma 6.4). 


Remark 6.1. We also give a combinatorial description of those classes 
Fi, ..., Fk which do not come from cycles in T: Let Z > 0 be a carry 
of an accessible state of T. Then A{1) = cx) if and only if there is a 
recnrsion transition from some {l,j) to some with I' ^ 0 . 


Proof. Let {I, jo) be any accessible state with carry 1. We nse the 
longest path with inpnt 0 nsing storing transitions only to arrive in 
some state {l,j) —again, hniteness of this process will follow from the 
hniteness of the accessible part and the fact that the levels increase 
along storing transitions. As there is no storing transition leaving 
{l,j) by constrnction, there is a recnrsion transition from {l,j) to some 
By the remark following (64), I' = A{1) or F ^ 0. □ 


As nsnal, if reaching a state which is the origin of a transition with 
empty inpnt, the process may stay in that state or may continne to the 
destination state writing the ontpnt of the transition withont reading 
an inpnt. This is the reason why the transdncer is non-deterministic. 

Note that in onr case, transitions with empty inpnt (i.e., recnrsion 
transitions) lead to non-hnal states and transitions with non-empty 
inpnt (i.e., storing transitions) lead to hnal states. Combined with the 
fact that each state is either the origin of one recnrsion transition or of 
q'^ storing transitions, processing an inpnt is in fact deterministic: For 
every admissible inpnt—we do not allow leading zeros—, there exists 
exactly one path leading from the initial state to a hnal state with 
the given inpnt. This will enable ns to simplify the transdncer T to a 
deterministic transdncer T later on. 

We need the property that the carries of accessible states are not 
“too negative”: 


Lemma 6.2. (1) If{l,j) is an accessible state, then 

(65) q^n + / > 0 

holds for all n > 0 with n 7 ^ 0 . 

(2) If d> 2 and {l,j) is an accessible state, then 


I > 0 . 
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(3) Any accessible transition with input £ 7 ^ 0 leads to a state {l,j) 
with I > 0. 

(4) If d = 1 and {l,j) is an accessible state, then 


I > Zinin = niin< 


0 , 


-1 + ^ 
1 1 ~ 


Proof. The first assertion is easily shown by induction and (64). The 


second assertion follows by induction and from the assumption that 


Fa > 0 holds for all A. To prove the third assertion, we use (65) on the 
originating state of the transition. 

The last assertion is shown by induction. It is clearly valid in the 
initial state. For storing transitions, the value of Z is non-decreasing. 
If there is a recursion transition from some (Z,j) to some {l',j')N, we 
have 

ftA J_ rx> 


I'= I 


> q 


ka 


qr, 

L 


qK 


1 

gKA 


T Zinin I 


4-1 + 7^ 

gK gKA 

^ ' =l 


□ 


As leading zeros are not allowed, the last transition in the computa¬ 
tion path of any valid input has input s 7 ^ 0 and thus leads to a state 
with a non-negative carry. 

For our further investigations and finally the correctness proof, we 
need a suitable invariant: 


Lemma 6.3. Consider a path from {l,j) to with input label 

Em-i • • • ^ 0 ; output label dm'-i ■ ■ - do using L recursion transitions and 
n > 0. Thus m' is the number of transitions and m = m' — L is the 
number of storing transitions. 

// n 7 ^ 0 or if the last transition is a storing transition with non-zero 
input £m-i, then 

(66) A^{q^ {q'^n + {sm-i ■ ■ ■ So)q) + 1) = q^'n + T, 


and, if the recursion ( 10 ) is well-posed, 


m'-l 

(67) a{q^{q^n + {Sm-i ■ ■ ■ e^o)?) + 0 = a{q^'n + I') + 4- 

k=0 

Proof. First consider the case that the path consists of a single transi¬ 
tion. If it is a storing transition, then L = 0, m = 1, and all assertions 
follow from the definition and Lemma 6.2 On the other hand, if the 


transition is a recursion transition, we have L = 1 , m = 0 , and all 


assertions again follow from the definition. Lemma 6.2 and (64). 


By induction on the length of the path, we obtain ( 66 ) and (67). □ 
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We are now able to prove the finiteness of the accessible part. 

Lemma 6.4. The transducer has a finite number of accessible states. 

Proof. For a recursion transition from {l,j) to {I 'we have j > j'. 
Thus, there are no infinite paths consisting only of recursion transitions. 
In particular, there exist no cycles of recursion transitions. 

For d = 1, let .J > K he minimal such that > — |_^fj — 

minx q~'^^rx. Then A{q^ + 1) < oo holds for all accessible states (/,j) 
with j > J. This implies j < J for all accessible states (/,j). For 
d > 2, we have j < k =-. J for all accessible states {l,j). Thus there 
are at most J consecutive recursion transitions. 

To prove that only finitely many states are accessible, we introduce 
the notion of heights of states: The height of a state {l,j) is defined 
to be = lq~K If there exists a storing transition from (/, j) of height 
h to of height h', we have < h' < + 1. If there exists 

a recursion transition from {l,j) of height h to {I of height h', 

we have h + s~ — 1 < h' < h + s~^ where s’*" = ma,xx{r\q~^^, 0} and 
s~ = minA{nA( 3 '“'^^, 0}. 

Assume that there is a path from {l,j) of height h to {I',j') of height 
h' with L < J recursion transitions and one storing transition (in this 
order). Then we have 

-h + —is~ — 1) <h' < -h + —s^ + 1. 

q q q q 

We can subdivide every path in the transducer starting with the 
initial state into a sequence of such paths and a final path consisting 
of only recursion transitions. Let hm be the sequence of heights of the 
states where the subpaths starts. Then, we have 

— hm H-(s — 1) < hm+l < —hm H-S’*" + 1. 

q q q q 

Iteration leads to 

J{s- - 1) ^ ^ ^ Js+ + ql 

f:: 

q — 1 q — ^ 

for all m. Therefore, the height h of an accessible state is bounded. 
Since 0 < j < J is also bounded, the integer carry I = q^h of an 
accessible state {l,j) can only take finitely many different values. The 
accessible part of the transducer is thus finite. □ 

Lemma 6.5. Let V be an infinite path with input zero starting at some 
state of level j such that all of its states have non-negative carries. 
Then, after at most j transitions, it reaches a state {Iq,k). From that 
point on, it only passes through simple states, namely 

{Iq, k), {h,ji + 1)f, • • •, {h, 

{h,j2)N, {h-,j2 + 1)f, • • •, (^2, «)f, 
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(^3, is)A, {hyjs + 1)f, • • • , {h, «)f, 


where U = A{li_i) and ji = mod q- for i>l. 

Proof. Denote the first state of P by (Z,i). 

First, assume that j > k. As storing transitions always increase 
the level and the levels are bounded by Lemma 6.4, the path has to 
contain at least one recursion transition. Thus the path starts with 
k > 0 storing transitions leading from {l,j) to {l,j + k), followed by a 
recursion transition from {l,j + k) to By assumption, we have 

I > 0 and I' > 0. Thus A^l) = I' ^ oo by (64). Therefore, there is 
a recursion transition leaving (i,i), i.e., there were no leading storing 
transitions. Recall that j' < j holds for any recursion transition. We 
repeat the argument at most j — k times until we reach a simple state. 

If we are in a simple state {I',j') with j' < k, the next k — j' steps 
will be storing transitions, leading to (F, k). This means that after at 
most j steps, we reach a state {Iq, a). 

We now apply the argument of the second paragraph again. Thus a 
recursion transition leads to (^i, ji) with li = A{Iq) and ji = kiq modg-^- 
The remainder of the lemma follows by induction. □ 


As an auxiliary structure for deciding the well-posedness of the re¬ 
cursion, we introduce the recursion digraph TZ. It has set of vertices Nq 
and arcs (n, A(n)) with label frimodq'' for all n G Nq with A{n) < oo. 
Thus a{n) can be computed from the successor of n in 7^ using the 
recursion (10). By dehnition, each vertex of TZ has out-degree 1 or 0. 
Each component of 7?. is a functional digraph or a rooted tree (oriented 
towards the root). 

If 

II II IIA||qo T ||t^||oo 


n 


> max 

A 


— q 




we have 


q ||n| 


- IIAI 


> q 


KX I 


n\ 


+ TA 


and therefore 

||g'"n-h A||oo > + ta||oo 

for all 0 < A < g'^1. Thus we have ||n.'||oo < Halloo for all but hnitely 
many arcs (n, n') of TZ. 

Thus for every vertex of TZ, there is a unique path starting in this 
vertex and leading to a vertex with out-degree 0 or a hnite cycle. 

From this description, it is clear that the recursion is well-posed if 
and only if 


• the sum of the labels of each cycle in 7?. is 0 and 

• the set X consists of one element for every cycle in TZ as well as 
of the vertices with out-degree 0 in TZ. 
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We now prove the essential connection between the recursive digraph 
and the transducer T. This also implies that the classes Fi, ...,Fk 
are disjoint. 

Lemma 6.6. There exists a bijection between cycles in the recursive 
digraph IZ and accessible cycles in the transducer T with input 0 and 
simple states. Corresponding cycles under this bijection have the same 
output sum and sum of labels. 

Proof. Let Uq, ..., til = no he a cycle in the recursive digraph with 
> 0 for all 0 < i? < L. ^ 

Let ko be the length of the path Vq in T starting in the initial state 
and reading the g-ary expansion of no- 

We determine the destinations of certain paths in the transducer 
associated with the cycle in the recursive digraph. 


Statement 6.7. Let k > ko and V be the path from the initial state 
(0,0) to {l,j) of length k whose input label is the q-ary expansion of 
no, padded with leading zeros. Assume that the number of recursion 
transitions in this path is LQ + R for some Q > 0 and 0 < R < L. 
Then I = n^ > 0. 


Proof of Statement 6. 7, Let k' = k — [LQ+R) be the number of storing 
transitions of V. By (66), we have 

( 68 ) 

for n > 0, n 7 ^ 0. 


Note that for M > k and n = n' (mod q^) with A{n) < oo and 
A(n') < oo, the definition of A implies A{n) = A{n') (mod q^~'^). 
Together with the definitions of and the recursive digraph TZ as 


well as (68), this implies 

riR = A^^^^{no) = + no) 

= + I (mod 

for sufficiently large M. Coarsening yields 
nn = I (mod 

still valid for sufficiently large M. As I is bounded by Lemma 6.4, this 
implies n^ = 1. □ 


Now, we conclude the proof of Lemma 6.6 


Let V be the infinite path in T starting at the destination of Vo and 
reading zeros. By Lemma [6A] applied to V together with Statement 6T 


applied to Vo concatenated with prefixes of V, V leads to a cycle in 
T. Its states are simple and have carries no, ..., ni_i and levels 
determined by no, ..., n^^i as in Lemma 6.5 


This construction defines a map from the cycles of the recursive 
digraph TZ to the accessible cycles with input 0 in the transducer with 
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simple states. This map is injective by construction. Under this map, 
the sum of the labels of the cycle in TZ equals the sum of output labels 
of the cycle in T by construction. 

On the other hand, let 

(no, jo), (no Jo + 1),..., (no, k), 

(ni,ji),(ni,ji + l),...,(ni,K),... 

(nL_i, ji-i), {til-iJl-i + 1),..., (riL-i, k), 

(no, jo) 


be an accessible cycle of simple states in the transducer with input 0. 
Lemma 6.5 yields = ni^+imodL > 0 for 0 < i? < L. Thus, this 

cycle in the transducer is the image of the cycle Uq, ..., til = tiq in 
the recursive digraph. Thus the map is surjective. □ 

To use Theorem we simplify T to obtain the deterministic trans¬ 
ducer T, that is one without transitions with empty input. As a first 
step, we remove all non-accessible states. By Lemma 6.4, this leaves 
us with hnitely many states. 

By Lemma 6^ and the fact that recursion transitions decrease the 
level, the length of paths consisting of recursion transitions only is 
bounded. As a recursion transition always leads to a non-hnal state, 
processing an input never ends with a recursion transition. 

Consider a recursion transition from {l,j) to {I' with output 
t such that no recursion transition originates in (I ' ij')N- For each 
transition originating in say to some {1"J")f with input e 

and output F, we insert a storing transition from (/, j) to {I",j")F with 
input £ and output t + t'. Then, the recursion transition from {IJ) 
to {l',j')N is removed. The number of recursion transitions decreased 
by one and the new transducer generates the same output as the old 
transducer. We repeat this process until there are no more recursion 
transitions. Then, all non-hnal states are inaccessible and are removed. 


Proof of Theorem By Lemma |6.6 and the characterization of well- 
posedness via the recursive digraph, the recursion (10) is well-posed if 
and only if X consists of exactly one representative of each of the sets 
A), 1 < j < K, and if T has no cycle with simple states, input 0 and 
non-vanishing output sum. 

We now show that the cycles of simple states with input 0 in T 
are exactly the reductions of the cycles of simple states with input 0 
in T. As a cycle with simple states and input 0 in T does not have 
consecutive recursion transitions (cf. Lemma 6.5), it is reduced to a 
cycle with simple states in T. On the other hand, consider a cycle of 
T with input 0 containing a non-simple state. If there is a state of 
level > K, the state with largest level is hnal and is not removed. If 
all states have level < k, then there are no two consecutive recursion 
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transitions, so no negative carry is completely removed from the cycle 
in the rednction to T. Therefore, snch a cycle is not rednced to a cycle 
with simple states and inpnt 0 in T. 

Therefore, the assertion on well-posedness is proved. 

To prove correctness of the transdncer, we nse (66) with (/,j) = 
(0,0), the joint g-ary expansion of n as inpnt leading to some state 
{I',j')p with ontpnt 6m'-i ... 5o- By Lemma 6.2, we have I' > 0 because 
the last transition is a storing transition with non-zero input. Thus 
by (67), a(n) = a{l') + ^k- As the final output of {l',j')p is 

defined to be a{l'), we obtain T(n) = a{l ') + Er=o' 4 = a{n) , as 
requested. □ 
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